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^ . ABSTRACT 

' Higher Criticism is a recently developed statistic for non-Gaussian detection, 

' proposed in Donoho & Jin 2004, where it has been shown to be effective at 

lo ! 

, resolving a very subtle testing problem: whether n normal means are all zero 

. versus the alternative that a small fraction is nonzero. Higher Criticism is also 

I ' useful in the detection of a non-Gaussian convolution component of cosmic 
O 

Y-{ I strings in the Cosmic Microwave Background (CMB), see Jin et al. 2004. 

, In this paper, we study how well the anisotropics of the CMB fit with the 

homogeneous and isotropic Gaussian distribution predicted by the Standard 
• 1—1 . 

' Inflationary model. We find that Higher Criticism is useful for two purposes. 

H ' 

' First, Higher Criticism has competitive detection power, and non-Gaussianity 

is detected at the level 99% in the first year WMAP data. We generated 5000 
Monte Carlo Gaussian simulations of the CMB maps. By applying the Higher 
Criticism to all of these maps in wavelet space, we constructed confidence 
regions of Higher Criticism at levels 68%, 95%, and 99%. We find that the 
Higher Criticism value of WMAP is outside the 99% confidence region at 
a wavelet scale of 5 degrees (99.46% of Higher Criticism values based on 
simulated maps are below the values for WMAP). Second, Higher Criticism 
offers a way to locate a small portion of data that accounts for the non- 
Gaussianity. This property is not immediately available for other statistical 
tests such as the widely-used excess kurtosis test. Using Higher Criticism, 
we have successfully identified a ring of pixels centered at {I k, 209°, 6 « 
—57°), which seems to account for the observed detection of non-Gaussianity 
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at the wavelet scale of 5 degrees. By removing the ring from the WMAP data 
set, no more prominent deviation from Gaussianity was fonnd. Note that the 
detection is achieved in wavelet space first. Second, it is always possible that 
a fraction of pixels within the ring might deviate from Gaussianity even if 
they do not appear to be above the 99% confidence level in wavelet space. 
The location of the ring coincides with the cold spot detected in Vielva et al. 
2004 and Cruz et al. 2005. 

1 INTRODUCTION 

The Standard Inflationary model solves the horizon, the flatness, and the monopole prob- 
lems, and provides a framework for the formation of structure in the universe (Guth 1981, 
Guth & Pi 1982). Regarding the latter, the Standard Inflationary model predicts the exis- 
tence of quantum density fluctuations that are amplifled during the inflationary period and 
that grew, through gravitational instabilities, into the galaxies and clusters that populate 
our universe. These primordial density fluctuations are predicted to form a homogeneous 
and isotropic Gaussian field. The predicted statistical distribution of the Cosmic Microwave 
Background (CMB) temperature fiuctuations reflects that of the primordial density fluctua- 
tions. Testing this prediction has been the aim of many works in the literature. In particular, 
since the release of the first year of data collected by the WMAP (Wilkinson Microwave 
Anisotropy Probe) sateUite (Bennett et al. 2003), a considerable number of papers have 
presented different statistical analyses based on data in real space, spherical harmonics, and 
wavelet space (Park 2004, Eriksen et al. 2004a, Erikscn et al. 2004b, Erikscn et al. 2005, 
Larson & Wandelt 2004, Hansen et al. 2004, Prunet et al. 2005, Vielva et al. 2004, Cruz 
et al. 2005, Mukherjee & Wang 2004, Mc.Ewen et al. 2004). AU these works have claimed 
the detection of deviations from the predictions of the Standard Infiationary model in the 
WMAP data set optimal for CMB studies (see Komatsu et al. 2003). Several other works 
have presented statistical methods shown to be very powerful in detecting deviations from 
the Standard Infiationary model in the so-called Internal Linear Combination (ILC) map 
(Chiang et al. 2003, Coles et al. 2004, Copi et al. 2004, Naselsky et al. 2003, Chiang & 
Naselsky 2004). In this case the most convincing source of deviations is foreground related. 

Deviations from the predictions of the Standard Infiationary model can have a cosmo- 
logical origin. Non-Gaussianity can be generated under different conditions. A review on the 
predictions from several alternative scenarios including multi-field infiation, inhomogeneous 
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reheating, non-linearities in the gravitational potential, and the curvaton-based model can 
be found in Bartolo et al. 2004. However, these deviations can also be caused by systematic 
effects or noise associated with the experiment as well as foregrounds (Galactic or extra- 
galactic). The first year of data collected by the WMAP satellite has undergone careful 
characterization and examination by the WMAP team in an attempt to fully understand 
the data set (Page et al. 2003, Hinshaw et al. 2003, Bennett et al. 2003b, Barnes et al. 2003, 
Jarosik et al. 2003). The detection of deviations indicated in the previous paragraph show 
the presence of some asymmetry between the northern and southern hemispheres. Moreover, 
analyses in wavelet space in different regions (north, south, northeast, northwest, southeast, 
southwest of the Galactic plane) performed by Vielva et al. 2004 and Cruz et al. 2005 indi- 
cate that the source of deviations might be a cold spot located at {I 209°, b —57°). The 
nature of the observed deviations is still not clear. 

The implementation and development of new statistical methods is indispensable to im- 
prove our understanding of the deviations from the Standard Inflationary model predictions, 
in particular those observed in the WMAP data. There are many kinds of non-Gaussianity, 
and each type may be sensitive to some statistical tests but immune to others. We need 
more types of tests, such as Higher Criticism in order to better understand non-Gaussianity. 
The Higher Criticism (HC) statistic was first proposed in Donoho & Jin (2004) for testing 
a very subtle problem: given n Gaussian observations with same standard deviations but 
different means, we want to test whether n means are all zeros versus the alternative that 
a small fraction of them is nonzero. The fraction of nonzero means is too small to have 
any effect on the bulk of the observations, so all statistics based on moments (e.g. excess 
kurtosis (k,)) would generally have no power for detection. With careful cahbrations, HC 
was proved to be optimal in detecting such situations. HC is also competitive for detection 
in other scenarios. An analysis of simulated CMB on fiat patches of the sky using the HC 
statistics was presented in Jin et al. 2004. The simulated images included CMB signals, as 
well as cosmic strings and Sunyaev-Zeldovich type emission. Analyses were performed in 
wavelet and curvelet spaces (Candes & Donoho 2000). The very particular non-Gaussianity 
introduced by the simulated effects, as well as the restricted size of the sample, resulted in 
the lower sensitivity of this method in comparison with the traditional k statistic. However, 
theoretical studies indicate that the HC statistic will outperform the k when the sample 
size gets larger. 

Besides its competitive detection power, HC is also valuable in identifying the origin 

© 2001 RAS, MNRAS 000, 1-?? 



4 

of detected deviations, offering a way to determine which portion of the data contributes 
most to the deviation; this property is not available for many other statistics such as k. As 
mentioned above, analyses of the WMAP data in wavelet space suggest that the source of 
the detected deviations is due to the cold temperatures of certain pixels within a spot located 
in the southern hemisphere. The HC test is especially sensitive to abnormally high amounts 
of moderately large observations and, as shown in this paper, its application to WMAP 
data shows deviations from the predictions of the Standard Inflationary model. Moreover, 
inspection of individual pixel HC values in the WMAP data provides a direct means to 
identify the pixels at the source of the deviation. No additional region-by-region analysis is 
needed. The computational cost is therefore strongly reduced, making it a promising test 
for analyzing future higher resolution data, such as data from the Planck mission*. 

The paper is organized as follows. The HC statistical test is described in detail in Section 

2. A comparison between the performance of this statistic and that of the kurtosis (k) and 
the Maximum (Max) is included in subsection 2.1. The WMAP data is analyzed in real and 
wavelet spaces. A description of the combination of WMAP data in different channels, the 
Monte Carlo simulations, and the process followed in wavelet space is included in Section 

3. The analysis of the WMAP data is presented in Section 4. Section 5 is dedicated to 
conclusions and discussion. 

2 HIGHER CRITICISM 

The HC statistic was first proposed in Donoho & Jin 2004, Jin 2004. Given n independent 
observations of a distribution which is thought to be slightly deviated from the standard 
Gaussian, one can compare the fraction of observed significances at a given a-level (i.e. the 
number of observed values exceeding the upper-a quantile of the standard Gausian) to the 
expected fraction under the standard Gaussian assumption: 

\/n\{Fraction of Significance at a) — a\/{a{l — 

The HC statistic is then defined as the maximum of the above quantities over all significance 
levels < q; < 1. Given n individual observations Xi from a distribution which is thought to 
be symmetric and slightly deviated from the standard Gaussian, there is a simpler equivalent 
form of HC defined as follows. First, we convert the individual XiS into individual p- values: 

* http://www.rssd.esa.int/index.php?project=PLANCK 
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Pi = P{|A^(0, 1)1 > l^il}, then we let < p(2) < . . . < p(n) denote the p-values sorted in 
ascending order, define: 



HCn,i = Vn 



^Jpii){l-Pii)) 



the HC statistic is then: 



or in a modified form: 



HC*^ma;KHCnA 



HC^ — max HCn,i', 

{i: l/n^(i)<l-l/n} 

we let HCn refer either to HC* or HC:^ whenever there is no confusion. The above definition 
is slightly different from that in Donoho & Jin 2004, but the ideas are essentially the same. 

HC is useful in non-Gaussianity detection when Xj are truly from A^(0, 1), with the result 
that HCn,i is approximately distributed as A^(0, 1) for almost every i. Thus an unusually 
large HC^ or i?C+ value strongly implies non-Gaussianity. Moreover, HCn,i also provides 
localized information on deviations from Gaussianity. We can track down the source of non- 
Gaussianity by studying which portion of the data gives unusually large HCn^i. 

Previous works have claimed the detection of deviations from the predictions of the 
Standard Inflationary model in the First Year WMAP data, using the k statistic in wavelet 
space (Vielva et al. 2004, Mukherjee & Wang 2004, McEwen et al. 2004). In order to estabhsh 
a comparison between this statistic and the HC we will include calculations of k in our 
analysis as well. For completeness, we will also use the so-called Max statistic. The definitions 
for these two statistics are provided below. A discussion about the theoretical power of the 
three statistics to detect deviations from Gaussianity is inchidcd in the following subsection. 

Kurtosis. K. For a (symmetric) random variable X, k is n{X) = ^Jj|^2]j2 ~ 3, which uses 
the 4th moment to measure the departure from Gaussianity. k is useful in non-Gaussianity 
detection as k{Xi, X2, . . . , Xn) ^ i^{X) for large n. 

Max. The largest (absolute) observation is a classical statistic: 

M„ = max{\Xi\, IX2I, . . . , 

Max is useful in non-Gaussianity detection because M„ fa -\/2 logn when Xi are truly from 
A^(0, 1); thus a significant difference between and \/2 logn implies non-Gaussianity. 
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2.1 Comparison of Higher Criticism, Max and Excess Kurtosis 

The aim of this section is to estabhsh a simple theoretical comparison between the three 
statistics apphed in this paper. We show the power of the different statistical tests in detect- 
ing the distortion generated by a faint non-Gaussian signal (modeled as a function of the 
decaying rate of the tail of the distribution) superposed on a Gaussian signal. Similar to the 
analysis presented in Jin et al. 2004, the superposed image can be thought of as F = iV + G, 
where Y is the observed image, N is the non-Gaussian component, and G is the Gaussian 
component (assumed to have mean zero and dispersion one, A'"(0, 1)). We study the power 
of the three statistics in testing whether N — or not. 

One can do such a test either in real space (the space of the observations) or, as it is 
done in this paper, in wavelet space. For sufficiently fine resolution, the wavelet coefficients 
Xi of Y can be modeled as: 

Xi — Vl — A • Zi + y/X • Wi, 1 ^n, 

where n is the number of observations, 1 ^ A ^ is a parameter, Zi ~ ^'^(0, 1) are the trans- 
form coefficients of the Gaussian component G, and Wj ~ W arc the transform coefficients 
of the non-Gaussian component N. W is some unknown distribution with vanishing first 
and third order moment (this constraint is only imposed to simplify the possible range of 
non-Gaussian distributions, and it will be representative of a Gaussian distribution modified 
by a few large values contributing to a tail) . Without loss of generality, we assume the stan- 
dard deviation of both Zi and Wi are 1. Phrased in statistical terms, the problem of detecting 
the existence of a non- Gaussian component is equivalent to discriminating between the null 
hypothesis and the alternative hypothesis: 

Hi: Xi = Vl- X- Zi + VX-Wi, O^A^l. 

N = being equivalent to A = 0. 

In order to obtain a quantitative estimation of the ability of the three statistics to detect 
the non-Gaussian component, we parametrize the tail probability of as follows. 

lim a;"P||H^| > x} — Ca, Cq, is a constant. 

To model increasingly challenging situations as the number of observations increases, we 
calibrate A to decay with n as: 

A = A„ = . 
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To constrain the detectability of such a non-Gaussian distribution superposed to a Gaussian 
one, we search the r and a parameter space. We define the regions of this space that will 
be detectable under different statistical tests. It was shown in Jin et al. 2004 that there 
is a curve in the r-a plane that separates the detectable regions of parameters from the 
undetectable regions. That curve is given by: 

J 2/a, a ^8, 

[ 1/4, a > 8. 

In Figure 1, we compare the results of HC, Max and k,. When it is possible to detect, HC 
or Max are better than k when a ^ 8. k is better than HC or Max when a > 8. 

To conclude this section, we remark that the performance of HC, Max, and k depends 
on different situations of non-Gaussianity. Intuitively, Max is designed to capture evidence 
of unusual behavior of the most extreme observations against the Gaussian assumption. The 
HC statistic is able to capture unusual behavior of the most extreme observations, as well 
as unusually large amounts of moderately high observations. Thus HC is better than Max, 
in general. However, when the evidence against the Gaussian assumption truly lies in the 
most extreme observations, HC and Max are almost equivalent. In contrast, k is designed 
to capture the evidence hidden in the 4th moment. Therefore this statistic depends on the 
bulk of the data, rather than on a few extreme observations or a small fraction of relatively 
large observations. Lastly, the HC statistic ofi^ers an automatic way to find the area of the 
data accounting for the non-Gaussianity, while the Max and k, statistics do not have this 
capability. 



3 WMAP FIRST YEAR DATA AND SIMULATIONS 

The data collected by the WMAP satellite during the first year of operation is available at the 
Legacy Archive for Microwave Background Data Analysis (LAMBDA) website^. Following 
Komatsu et al. 2003, the analysis presented in this work is based on a data set obtained by 
the weighted combination of the released Foreground Cleaned Intensity Maps at bands Q,V 
and W. Each of these maps can be downloaded in the HEALpixi nested format at resolution 
nside = 512 (total number of pixels being 12 x nside^). The co-added map is obtained by 

t http://lambda.gsfc.nasa.gov/ 

t HEALPix http://www.eso.org/science/healpix/ 
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Figure 1. Detectable regions in the a — r plane. With (a, r) in the white region on the top or the undetectable region, all 
possible statistics fail asymptotically for detection. With {ce,r) in the white region on the bottom, both k and Max/HC are 
able to detect reliably. While in the blue region to the left, Max/HC is able to detect reliably, but k completely fails. In the 
yellow region to the right, k is able to detect reliably, but Max/HC completely fail asymptotically. 



The temperature at pixel i, T{i) results from the ratio of the weighted sum of temperatures 
at pixel i at each radiometer divided by the sum of the weights of each radiometer at pixel 
i. The radiometers Ql, Q2, VI, V2, Wl, W2, W3 and W4 are sequentially numbered from 
3 to 10. The weights at each pixel, for each radiometer Wr{i), are the ratio of the number of 
observations Nr{i) divided by the square of the receiver noise dispersion (To,r-This results in a 
co-added map at resolution nside = 512. This map is downgraded to resolution nside = 256 
before the analyses are performed. In order to remove Galactic and point source emission, 
we applied the most conservative mask provided by the WMAP team, the KpO mask. This 
follows the procedure first presented by Vielva et al. 2004. 

In order to detect any possible deviations from the predictions of the Standard Infla- 
tionary model we compared the values of several statistics in the WMAP data set described 
above, with those obtained from 5000 Monte Carlo simulations. The temperature at a cer- 
tain pixel i (pointing towards a direction characterized by polar angles 6i and (pi), can be 
expressed as an expansion in spherical harmonics Yifn{Oi,(j)i) 



The Monte Carlo simulations were performed assuming a Gaussian distribution A^(0, Ci) for 
the spherical harmonic coefficients aim where Ci represents the power spectrum that best 



the following combination 
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fits the WMAP, CBI and ACBAR CMB data, plus the 2dF and Lyman-alpha data. Beam 
transfer functions as well as number of observations and receiver noise dispersion were taken 
into account when simulating data taken by each of the receivers (all of these, as well as the 
best fit power spectrum are provided by the WMAP team in the LAMBDA website) . 

The analysis was carried out in both real space and wavelet space. We convolved the 
WMAP data set and the Monte Carlo simulations with the Spherical Mexican Hat Wavelet 
(SMHW) at fifteen scales, following the same procedure presented in Vielva et al. 2004 and 
Cruz et al. 2005. The scales, numbered from 1 to 15, correspond to 13.74, 25.0, 50.0, 75.0, 
100.0, 150.0, 200.0, 250.0, 300.0, 400.0, 500.0, 600.0, 750.0, 900.0, and 1050.0 arcmin. In 
order to avoid border effects caused by the mask included in the WMAP data set and the 
simulations, analyses are performed outside extended masks defined at each scale. Given a 
scale, the extended mask is the KpO mask plus pixels near the Galactic plane that are within 
2.5 times the scale. All of these extended masks were presented in Figure 2 of Vielva et al. 
2004. As discussed in the next section, deviations are detected in wavelet space. This shows 
once again the value of wavelets to provide a space in which certain features that might be 
causing deviations from the predictions of the Standard Infiationary model are enhanced. 

4 ANALYSIS 

Based on 5,000 simulations, we calculated the 68%, 95%, and 99% confidence regions for 
each of the 4 statistics (k. Max, and the two HC tests) at each of the 15 wavelet scales 
used. We used two-sided confidence regions for k, as it is symmetric about under the 
null hypothesis that the data is Gaussian. Max and HC statistics arc defined so that the 
larger the statistic, the stronger the evidence against the null hypothesis. Therefore, we used 
one-sided confidence regions for these statistics. 

Figure 2 shows the results obtained based on k. Dots denote the value of this statistic for 
the WMAP data set. Bands outhned by dashed, dotted-dashed, and solid fines correspond to 
the 68%, 95%, and 99% confidence regions respectively (symbols and fines will represent the 
same in the figures included in this paper unless a comment is added). Clearly, the results 
agree with those presented in Vielva et al. 2004. Figures 3 and 4 show the new results based 
on the Max and HC statistical tests. The WMAP data appears outside the 99% confidence 
level obtained based on the predictions of the Standard Infiationary model at scale 9 (300 
arcmin) for these three tests. In particular, 99.46% of the 5000 simulations have Max 
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Figure 2. Values of the k statistic for tlie analyzed WMAP data set (dots). The bands outlined by dashed, dotted-dashed and 
solid lines correspond to the 68%, 95% and 99% confidence regions respectively. 
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Figure 3. Values of the Max statistical test for the analyzed WMAP data set (dots). The dashed, dotted-dashed and solid 
lines correspond to the 68%, 95% and 99% confidence levels respectively. 

and HC/HC^ values below the one obtained from the WMAP data set. Therefore in this 
particular case these statistics are as competitive as the kurtosis in detecting deviations from 
the assumed null hypothesis. 



4.1 Location of the outlying pixels provided by the Higher Criticism statistic 

As discussed above, the HC statistical tests introduced in this paper are able to detect 
deviations from the Standard Inflationary model in the WMAP data at scales around 300 
arcmin. Moreover, the power of this new test resides in providing a direct way to determine 
which pixels in the WMAP data set are generating these deviations. By comparing the 
values of the HC test at each pixel, with the value that defines the 99% confidence limit at 



© 2001 RAS, MNRAS 000, 1-?? 



Higher Criticism Statistic: Detecting and Identifying Non-Gaussianity in the WMAP First Year Data 11 




5 10 15 5 10 15 



scale scale 

Figure 4. Values of the HC/HC^ statistical tests for the analyzed WMAP data set (dots). The dashed, dotted-dashed and 
solid lines correspond to the 68%, 95% and 99% confidence levels respectively. 

scale 9, we were able to extract a total of 490 pixels that are causing the detected deviation. 
Figure 5 shows the individual pixel values of the HC statistic for the WMAP data set at 
scale 9. The selected pixels above the 99% limit are plotted in the map shown in Figure 6. 
As can be seen, these pixels define a ring centered at position (/ ^ 209°, 6 ~ —57°). It is 
important to note that the correlations introduced by the convolution with the wavelet need 
to be taken into account in order to properly interpret this result. Some pixels within the 
ring could also initially (in the map in real space) deviate from Gaussianity. 

4.2 Effects of the subtraction of pixels at the source of the detection 

Removal of the pixels in the ring from the analysed WMAP data resulted in a set of data 
compatible with the predictions from the Standard Inflationary model (see Figure 7). The 
deviation observed in the k statistic at the wavelet scale of 5 degrees, in this paper as well as 
in previous papers by Vielva et al. 2004, Cruz et al. 2005, Mukherjee & Wang 2004, McEwen 
et al. 2004, disappeared as the pixels in the ring (Z ^ 209°, h ~ —57°) were removed. These 
pixels are part of the cold spot pointed out by Vielva et al. 2004 and Cruz et al. 2005 as being 
the source of the observed deviations. The Max and the HC values decrease as well (after 
removal of the pixels in the ring) fitting now within the 68% confidence region. The Higher 
Criticism statistic is useful by offering an automatic detection of the non-Gaussian portions 
of the data. This characteristic is not easily available for other tests, such as Kurtosis. 
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Figure 5. Values of the HC statistical test at every pixel of the analyzed WMAP data set. After sorting all the p-values, a 
number is assigned to each pixel i. That number goes from (pixel with the smallest p- value) to the number of data (pixel 
with the largest p- value). The HC values at each pixel are plotted against i /number of data. The region corresponding to the 
extreme values of HC is shown in the figure on the right panel. 



Figure 6. A quarter of the whole sky showing the location of the WMAP pixels with HC values above the extreme of the 
99% limit obtained from 5000 simulations. 

5 CONCLUSIONS AND DISCUSSION 

This paper presents an analysis of the compatibihty of the distribution of the CMB temper- 
ature fluctuations observed by WMAP with the predictions from Standard Inflation. The 
analysis is based on the recently developed HC statistic. This statistic is especially sensi- 
tive to two types of deviations from Gaussianity: an unusually large amount of moderately 
significant values or a small fraction of unusually extreme values. In both of these cases the 
HC statistic is optimal (Donoho & Jin 2004, Jin et al. 2004). Moreover, the definition of the 
statistic naturally suggests a direct way to locate the possible sources of no n- Gaussianity. 
Even if the wavelet transform is a linear process, the original distribution is slightly distorted 
as the scale of the wavelet, and the area covered by the mask, increase. We are working on 
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Figure 7. Values of k, M ax and HC tests for the analyzed WMAP data set after subtracting the pixels that were causing the 
deviations from the predictions of the Standard Inflationary model (dots). The bands outlined by dashed, dotted-dashed and 
solid lines correspond to the 68%, 95% and 99% confidence regions respectively for ft. The dashed, dotted-dashed and solid 
lines correspond to the 68%, 95% and 99% confidence levels respectively for Max, HC and HC^ . 

improving the power of the HC statistic to be optimal in the detection of deviations from 
Gaussianity when apphed in wavelet space. 

We compared the performance of the HC statistic with that of k and Max. The com- 
parison was based on the analysis of the first year WMAP data, in real space and in wavelet 
space. Distributions of the three statistics were built on 5000 simulations assuming the 
Standard Inflationary model predictions as well as the constraints coming from the WMAP 
observations. The three statistics provided comparable results, pointing to the presence of 
deviations at a wavelet scale of 5 degrees. We made use of the HC statistic to automatically 
identify the pixels in the WMAP data set that were causing such deviations. A ring centered 
at (/ ~ 209°, b ^ —57°) containing 490 pixels was shown to be the cause behind the detected 
deviations. Removal of this set of pixels from the WMAP data made the data compatible 
with the predictions from the Standard Inflationary model. One should be cautious when 
interpreting this result. The detection is achieved in wavelet space. Even if only the pixels in 
the ring appear as outliers of the HC distribution at wavelet scale of 5 degrees, some other 
pixels whithin the ring could be involved in the observed deviation. Convolution with the 
wavelet introduces correlations between pixels that need to be properly taken into account 
in the definition of the statistic to allow a correct interpretation. It is important to note that 
the pixels selected as the source of the observed deviations are part of the cold spot pointed 
out by Vielva et al. 2004 and Cruz et al. 2005. A careful study of the possible influence of 
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foregrounds, noise, and beam distortion and assumption of a certain power spectrum was 
carried out in tliose two papers. 

To conclude we would like to remark on the power of the HC statistic for detecting and 
locating possible sources of non-Gaussianity. In particular, regarding analysis of the CMB, 
the HC statistic can be useful at several steps in the data processing and final analysis, from 
the study of distortions caused by systematic effects in the time-ordered data to the study 
of compatibility of the statistical distribution of the observed temperature fluctuations with 
predictions from theories of the very early universe. 
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